MICA: A fast short-read aligner that takes full advantage of Intel Many Integrated Core Architecture (MIC)
نویسندگان
چکیده
Background: Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alternative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC board contains only ~60 cores (while a GPU board typically has over a thousand cores). Results: To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC’s limitation and the extra parallelism inside each MIC core. Experiments on aligning 150bp paired-end reads show that MICA using one MIC board is 4.9 times faster than the BWA-MEM (using 6-core of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA’s simplicity allows very efficient scale-up when multiple MIC boards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). Summary: MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour less than 400 nodes. MICA has impressive performance even though the current MIC is at its initial stage of development (the next generation of MIC has been announced to release in late 2014). Availability and Implementation: MICA is under BSD (3-Clause) and freely available at http://sourceforge.net/projects/mica-aligner Contact: [email protected], [email protected] Supplementary information: Supplementary information is available at Bioinformatics online.
منابع مشابه
Intra-MIC MPI Communication using MVAPICH2: Early Experience
Knights Ferry (KNF) is the first instantiation of the Many Integrated Core (MIC) architecture from Intel. It is a development platform that is enabling scientific application and library developers to prepare for the upcoming products based on the MIC architecture. Intel MIC architecture, while providing the compute potential of a many-core accelerator, has the key advantage of supporting the e...
متن کاملVisual Exploration of Data with Multithread MIC Computer Architectures
Knowledge mining from immense datasets requires fast, reliable and affordable tools for their visual and interactive exploration. Multidimensional scaling (MDS) is a good candidate for embedding of high-dimensional data into visually perceived 2-D and 3-D spaces. We focus here on the way to increase the computational performance of MDS in the context of interactive, hierarchical, visualization ...
متن کاملCalculation of Stochastic Heating and Emissivity of Cosmic Dust Grains with Optimization for the Intel Many Integrated Core Architecture
Cosmic dust particles effectively attenuate starlight. Their absorption of starlight produces emission spectra from the nearto far-infrared, which depends on the sizes and properties of the dust grains, and spectrum of the heating radiation field. The nearto mid-infrared is dominated by the emissions by very small grains. Modeling the absorption of starlight by these particles is, however, comp...
متن کاملAsian Option Pricing on Intel® MIC Architecture
In this paper, we discuss the problem of pricing one exotic option, the strong path dependent Asian option using the Black–Scholes model and compare how the pricing algorithm can be implemented on Intel® Many Integrated Core or MIC Architecture and achieve impressive performance gains. We can demonstrate that a 2-year contract with 252 times steps and 1,000,000 samples can be priced in approxim...
متن کاملMulti-Kepler GPU vs. multi-Intel MIC for spin systems simulations
We present and compare the performances of two many-core architectures: the Nvidia Kepler and the Intel MIC both in a single system and in cluster configuration for the simulation of spin systems. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the Over-relaxation algorithm. We present data also for a traditional high-end multi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1402.4876 شماره
صفحات -
تاریخ انتشار 2014